Model Selection

Japanese speech recognition

# Japanese speech recognition

Japanese Hubert Base Phoneme Ctc

This model is a fine-tuned model for Japanese phoneme recognition using CTC based on rinna/japanese-hubert-base, which can effectively improve the accuracy of Japanese speech recognition.

Speech Recognition

Transformers Japanese

Kotoba Whisper V2.2

Japanese automatic speech recognition model based on Whisper, integrating speaker separation and punctuation addition functions

Speech Recognition

Transformers Japanese

Kotoba Whisper V2.0

Kotoba-Whisper is a Japanese automatic speech recognition distilled model developed by Asahi Ushio in collaboration with Kotoba Technologies, based on Whisper large-v3 distillation, achieving a 6.3x inference speed improvement.

Speech Recognition

Transformers Japanese

Japanese Wav2vec2 Base Rs35kh

A wav2vec 2.0 Base model fine-tuned on the large-scale Japanese automatic speech recognition corpus ReazonSpeech v2.0, suitable for Japanese automatic speech recognition tasks.

Speech Recognition

Transformers Japanese

reazon-research

Parakeet Tdt Ctc 0.6b Ja

Parakeet TDT-CTC 0.6B is an automatic speech recognition (ASR) model capable of transcribing Japanese speech with punctuation, developed by the NVIDIA NeMo team.

Speech Recognition Japanese

Kotoba Whisper V1.1

Kotoba-Whisper-v1.1 is a Japanese automatic speech recognition model based on Whisper, with added punctuation and timestamp processing capabilities.

Speech Recognition

Transformers Japanese

Wav2vec2 Base Japanese Asr

A speech recognition model fine-tuned on the common_voice_11_0 Japanese dataset based on rinna/japanese-wav2vec2-base, supporting only hiragana output

Speech Recognition

Transformers Japanese

Kotoba Whisper V1.0

Kotoba-Whisper is a Japanese automatic speech recognition distilled Whisper model collection jointly developed by Asahi Ushio and Kotoba Technologies, which is 6.3 times faster than the original large-v3 while maintaining similar low error rates.

Speech Recognition

Transformers Japanese

Whisper Small Japanese

This model is a Japanese speech recognition model fine-tuned based on openai/whisper-small, supporting Japanese speech-to-text tasks.

Speech Recognition

Transformers Japanese

Wav2vec2 Large Xlsr 53 Japanese

Japanese speech recognition model fine-tuned from facebook/wav2vec2-large-xlsr-53, supporting 16kHz sampled audio input

Speech Recognition

Transformers Japanese

Whisper Large V2 Mix Jp

An automatic speech recognition (ASR) model fine-tuned on Japanese speech datasets based on OpenAI Whisper-large-v2

Speech Recognition

Kan Bayashi Csj Asr Train Asr Transformer Raw Char Sp Valid.acc.ave

This is a Japanese automatic speech recognition (ASR) model trained using the ESPnet framework, utilizing the CSJ dataset and based on the Transformer architecture.

Speech Recognition Japanese

W2v Hf Commonvoice From Xlsr53 Pretrain 0329UTC1500

A speech recognition model fine-tuned on the Common Voice Japanese dataset based on facebook/wav2vec2-large-xlsr-53

Speech Recognition

Wav2vec2 Large Xlsr Japanese Hiragana

A Japanese speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, supporting hiragana output

Speech Recognition

Transformers Japanese

Wav2vec2 Large Xlsr Japanese 0325 1200

This is an automatic speech recognition (ASR) model fine-tuned for Japanese speech recognition tasks based on the facebook/wav2vec2-large-xlsr-53 model.

Speech Recognition

Transformers Japanese

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase